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^ ' Abstract 

I ■ A discussion is presented of the manner in which uncertainties in parton dis- 

^ \ tributions and related quantities are determined. One of the central problems 

is the criteria used to judge what variation of the parameters describing a set 
<^ • of partons is acceptable within the context of a global fit. Various ways of 

C<^ ! addressing this question are outlined. 

cn ■ 
(N ; 

lO ' 1. Introduction 

o . 

^ , The procedure of determining parton distributions by so-called global fits to data, mainly structure func- 

O \ tions, is long established However, it is a rather more recent development to try to determine the 

^ ' errors on these distributions at the same time. This has come about for a number of reasons. Firstly, the 

sheer amount of data (full references in [p^) sensitive to various parton distributions, and the precision 
of this data, has become such that an accurate determination of all parton distributions is possible (with 
some problems only in difficult to reach regions of phase space, e.g. x very near to 1). Secondly, the 
understanding of the experimental errors on this data has reached a new level of sophistication, with the 
. ^ , systematic errors being understood far better in terms of their separate sources and correlations. Lastly, 

^ ' the theoretical understanding at NLO in as has improved so that subtleties due to e.g. heavy quarks are 

■ now understood. 

There are many issues in the determination of errors on parton distributions, and a discussion 



of these may be found in [10]. However, one of the main outstanding problems, and the focus of a 



discussion session at this meeting, is the manner in which one determines precisely the size of the errors. 
2. Quality of fit 

The quality of the fit to a set of data is generally presented in terms of the x^. Taking fully into account the 
correlated systematic errors this may be calculated by the covariance matrix method, i.e. the covariance 
matrix is constructed as 

n 

dj = Sijaf gtat + ^ Pij(Tk,iCrk,j, (1) 
fc=i 

where k runs over each source of correlated systematic error and are the correlation coefficients. The 
is then defined by 
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X- = E E(A - T,{a))Cr^\D, - T,{a)), (2) 
i=i j=i 

where N is the number of data points, Di is the measurement and Ti{a) is the theoretical prediction 
depending on parton input parameters a. Alternatively, one can incorporate the systematic errors into the 



theory prediction 



fi{a,s) = Ti{a) + ^ SfcA 
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where Aj^ is the one-sigma correlated error for point i from source k. The is then defined by 
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where the second term constrains the values of s^, assuming that the coiTcIated systematic errors are 
Gaussian distributed. This is identical to the correlation matrix definition of if the errors are small. 
In many cases the statistical and systematic errors are simply added in quadrature, either for simplicity, 
because this has much the same result as the above procedures, or due to lack of information on the 
correlations of systematic errors. (The sophisticated treatment is found to be essential for Tevatron jets 
where correlated systematic errors dominate.) 

All groups performing global fits use some combination of the above ways to calculate x^> and 
minimizing with respect to the defined completely determines the parton distributions. In general, the 
quality of the total fit is reasonably good, e.g. for CTEQ6 one obtains x^/d.o.f. = 1954/1811 and for 
MRST2001 x^/d.o.f. = 2328/2097. 
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Fig. 1: The quality of the fit to the individual data sets included in the global analysis by MRST, shown together with the grand 
total a function of as{M%) 

In principle, the one-u error on some parameter in the fit, e.g. the value of 05 (M|) or one of the 
parameters describing the input parton distributions, can be determined by allowing the value of x^ to 
vary one unit from its minimum. In practice, this results in unrealistically small errors. This is shown in 
Fig. 1, which shows the variation of the x^ with asiM"^) for the total fit, and for individual sets within 
the global fit for the MRST partons. 



It is also demonstrated in an alternative manner in Fig. 2, where for the CTEQ6 fit the extraction 
of as{M^) obtained by allowing to increase by one unit for each data set within the fit is presented 
[pl|]. Clearly the size of the errors and the scatter of the points are unrealistic. 
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Fig. 2: Values of as{Mz) obtained by CTEQ from the quality of fit to individual data sets within the global fit. 

This failure of the Ax^ = 1 rule may have various sources. Firstly, it is clear from examination 
that not all of the data sets are consistent with each other, i.e. the errors have been underestimated in 
some manner. Thus fitting the data simultaneously leads to a certain degree of inconsistency. One could 
keep only those sets which are clearly consistent with each other (within the context of NLO QCD) [^. 
However, this leads to a severe restriction in the amount of data used, and the partons in some regions 
would lose important constraints. (Due to the correlation between different regions of x via sum rules and 
because evolution equations for partons involve convolutions over a range of x, problems in one range 
of X impact on the partons over the whole range.) A second problem is that certain assumptions have to 
be made when performing a global fit. These include the cuts made on data, data sets included in the fit, 
the parameterization for input parton sets, the form of strange sea, the precise heavy flavour prescription, 
the definition of the NLO strong coupling constant, the starting scale of evolution etc. Many of these 
can lead to variations considerably greater than those obtained from Ax^ = 1. Finally, we do not have 
a perfect theory - NLO-in-ag QCD has many corrections. Obviously there are higher orders (NNLO), 
which are not that small in themselves in QCD, but there are other problems. In particular at large and 
small X there are large logarithms associated with higher orders, i.e. at small x there are terms of the 



type a" ln"~ (I/2;), and at large x terms like In "~ (1 — x). These are not fully understood. Also 
at low Q"^ there are higher twist, i.e. 0{l/Q'^) corrections that ai^e again not well understood. Hence, in 
fitting data one parameter may be artificially well constrained at some incorrect value in some region of 
parameter space by the necessity to account for missing corrections to the theory. This can then influence 
other regions in the manner explained above, e.g. an artificially large gluon in one range of x can lead to 
it being too small in other regions because of the momentum sum rule constraint. 

This has led to various methods to determine errors by allowing a wider variation than the canon- 
ical Ax^ = 1. CTEQ actually allow Ax^ = 100 by various considerations, including error limits of 
individual experiments [12]. An example is the determination of asiM^) as shown in Fig. 3, where 



the error bars give the 90% confidence limit for each experiment (the size of which varies considerably 
between data sets), and the two lines give the band of 05 (M|) allowed by Ax^ = 100 for the global 



fit [11]. Arguably this is too conservative, and CTEQ regard it as more than a one-cr error. MRST use 



= 20 as an approximate one-u error instead, based on the essentially subjective criterion of judging 
when the fit is going uncomfortably wrong in some region ||^. 
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Fig. 3: Allowed value of asiM^) obtained by CTEQ from the quality of fit to individual data sets within global fit. 

A more precisely defined method is to use an alternative formulation for x^- In the offset method 
the fit is obtained by minimizing 

2 
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fi{a,s)) 



(5) 



i.e. the best fit and parameters are obtained from only uncorrected errors. The quality of the fit can 
then be estimated by adding errors in quadrature. The systematic errors on are determined by letting 
each Sfc = ibf and adding the deviations in quadrature. This is essentially the same as, and in practice 
achieved by calculating 2 Hessian matrices 



daidaj 
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daidsj ' 



(6) 



and defining covariance matrices 



M- 



M-WV^M-' Ctot = Cstat + Cs 



^stat = .^yj^ '^sys^J-Vi vv ivi ^tot ^ ^stat ^sys- (7) 

This method was used in early HI fits |[l|] and in ZEUS fits ||[ ^. Much the same method is used for 
polarized data in ||8]], as far as the information on correlated errors allows. Strictly speaking, the method 



is not optimum, and it leads to a much bigger uncertainty than the standard method for Ax = 1 [14] 



However, it may be suitable in practice [Q], and ZEUS estimate the same results could be obtained using 
the standard method and Ax^ = 50. 

The results of the various approaches may be summarized in a selection of the extracted values of 
a5(M|) below |1T|]. 
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0.1165 ±0.0065 

0.1166 ± 0.0008(uncor) ± 0.0032(corr) 
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The values obtained are consistent, and the errors not too dissimilar given the wide variation 
in Ax^ used. This is largely because each group has chosen a method which gives a reasonable and 
believable error. HI actually use the standard definition, obtaining a small error, but are only able to 
do this because they use a much smaller data set than the other groups, i.e. their own data plus some 
BCDMS data which are fully consistent. If MRST and CTEQ were to use the same approach their error 
would be tiny (and their exti^acted value of a5'(M|) some standard deviations away from that of HI, e.g. 
MRST2001 would obtain as(M|) = 0.1192 ± 0.0005). 



3. Conclusions 

In determining errors for and from parton distributions one encounters a dilemma. One could fit to a 
relatively small subset of the available data with a very conservative set of cuts so that data are self 
consistent and theory is truly reliable. However, this would lead to a poor determination of partons in 
certain regions and would give no idea how well the theory is working overall. Alternatively, one can try 
to constrain the partons in as wide a region of parameter space as possible, accepting that the standard 
rules for error determination will break down. The way in which one can obtain sensible estimates of the 
real errors in the latter case is a subject very much open for discussion. 



Acknowledgements 

We would like to thank some of our colleagues not at the discussion, A.D. Martin, J. Pumplin, R.G. 
Roberts and W.K. Tung for useful comments and the workshop organizers L.Lyons and M. Whalley. 

References 

[I] M. Gluck, E. Reya and A. Vogt, Eur. Phys. J. C5 (1998) 461. 

[2] A.D. Martin, R.G. Roberts, W.J. Stirhng and R.S. Thome, Eur. Phys. J. C23 (2002) 73. 



[3] CTEQ Collaboration: J. Pumplin et al, hep-ph/0201195 



[4] HI Collab: C. Adloff et al, Eur. Phys. J. C21 (2001) 33; 

B. Reisert, presented at DIS2002, Krakow, Poland, May 2002. 

[5] S. I. Alekhin, Phys. Rev. D63 (2001) 094022. 

[6] M. Botje, Eur. Phys. J. C14 (2000) 285. 



[7] A. M. Cooper-Sarkar, these proceeding hep-ph/0205153 



ZEUS Collab: S. Chekanov et al., in preparation. 



[8] J. Blumlein and H. Bottcher, hep-ph/0203155 



[9] W. T. Giele and S. Keller, Phys. Rev. D58 (1998) 094023; 



W. T. Giele, S. Keller and D. A. Kosower, |hep-ph/0104052 . 
[10] R.S. Thorne, these proceedings. 

[11] W.K. Tung, talk presented at DIS2002, Krakow, Poland, May 2002. 
[12] D. Stump et al, Phys. Rev. D65 (2002) 014012. 
[13] C. Pascaud and F. Zomer 1995 Preprint LAL-95-05. 
[14] S. I. Alekhin, |hep-ex/00 05042| . 



[15] A. M. Cooper-Sarkar, V. Shekelyan and R. S. Thorne, Structure Functions summary talk presented 
at DIS2002, Krakow (May 2002). 



